Introducing the Gab Hate Corpus: defining and applying hate-based rhetoric to social media posts at scale
نویسندگان
چکیده
We present the Gab Hate Corpus (GHC), consisting of 27,665 posts from social network service gab.com, each annotated for presence “hate-based rhetoric” by a minimum three annotators. Posts were labeled according to coding typology derived synthesis hate speech definitions across legal precedent, previous typologies, and psychology sociology, comprising hierarchical labels indicating dehumanizing violent as well indicators targeted groups rhetorical framing. provide inter-annotator agreement statistics perform classification analysis in order validate corpus establish performance baselines. The GHC complements existing datasets its theoretical grounding providing large, representative sample richly media posts.
منابع مشابه
Detecting the Hate Code on Social Media
Social media has become an indispensable part of the everyday lives of millions of people around the world. It provides a platform for expressing opinions and beliefs, communicated to a massive audience. However, this ease with which people can express themselves has also allowed for the large scale spread of propaganda and hate speech. To prevent violating the abuse policies of social media pl...
متن کاملDetecting Hate Speech in Social Media
In this paper we examine methods to detect hate speech in social media, while distinguishing this from general profanity. We aim to establish lexical baselines for this task by applying supervised classification methods using a recently released dataset annotated for this purpose. As features, our system uses character n-grams, word n-grams and word skip-grams. We obtain results of 78% accuracy...
متن کاملAnalyzing the Targets of Hate in Online Social Media
Social media systems allow Internet users a congenial platform to freely express their thoughts and opinions. Although this property represents incredible and unique communication opportunities, it also brings along important challenges. Online hate speech is an archetypal example of such challenges. Despite its magnitude and scale, there is a significant gap in understanding the nature of hate...
متن کاملSurfacing contextual hate speech words within social media
Social media platforms have recently seen an increase in the occurrence of hate speech discourse which has led to calls for improved detection methods. Most of these rely on annotated data, keywords, and a classification technique. While this approach provides good coverage, it can fall short when dealing with new terms produced by online extremist communities which act as original sources of w...
متن کاملHate Me, Hate Me Not: Hate Speech Detection on Facebook
While favouring communications and easing information sharing, Social Network Sites are also used to launch harmful campaigns against specific groups and individuals. Cyberbullism, incitement to self-harm practices, sexual predation are just some of the severe effects of massive online offensives. Moreover, attacks can be carried out against groups of victims and can degenerate in physical viol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Language Resources and Evaluation
سال: 2022
ISSN: ['1574-020X', '1574-0218']
DOI: https://doi.org/10.1007/s10579-021-09569-x